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Abstract —The capacity of the linear time-varying (LTV) chan¬ 
nel, a continuous-time LTV filter with additive white Gaussian 
noise, is characterized hy waterfllling in the time-frequency 
plane. Similarly, the rate distortion function for a related non¬ 
stationary source is characterized hy reverse waterfilling in the 
time-frequency plane. Constraints on the average energy or on 
the squared-error distortion, respectively, are used. The source 
is formed by the white Gaussian noise response of the same LTV 
filter as before. The proofs of both waterfilling theorems rely 
on a Szego theorem for a class of operators associated with the 
filter. A self-contained proof of the Szego theorem is given. The 
waterfilling theorems compare well with the classical results of 
Gallager and Berger. In the case of a nonstationary source, it 
is observed that the part of the classical power spectral density 
is taken by the Wlgner-Ville spectrum. The present approach 
is based on the spread Weyl symbol of the LTV filter, and is 
asymptotic in nature. For the spreading factor, a lower bound is 
suggested by means of an uncertainty inequality. 

Index Terms —Channel capacity, linear time-varying (LTV) 
channel, nonstationary source, rate distortion function, Szego 
theorem, time-frequency transfer function, uncertainty. 


I. Introduction 

T he characterization of the capacity of continuous-time 
channels with an average power constraint by waterfilling 
in the frequency domain, going back to Shannon IT], has 
been given by Gallager for linear time-invariant (LTI) 
channels in great generality. At least since the advent of 
mobile communications, there has been a vivid interest in 
similar results for LTV channels; see 0 , a, 0,0 to cite 
only a few. Although most wireless communication channels 
are modeled by random LTV filters O, 0, a waterfilling 
characterization of the capacity of deterministic LTV channels 
might also be of interest. Furthermore, many nonstationary 
continuous-time sources can be described as the response of 
an LTV filter to white Gaussian noise. It is therefore natural 
to ask for a solution to the dual problem, namely the reverse 
waterfilling characterization of the rate distortion function for 
such sources with a fidelity criterion. The classical answer 
to this question in the case of a stationary source, already 
outlined by Kolmogorov in 0, has been given by Berger |9] 
for a broad class of stationary random processes. Since then, 
until quite recently no], no similar results for nonstationary 
sources have been reported. Within the framework of time- 
frequency analysis, treating the time-frequency plane “as a 
whole” HD, we present waterfilling solutions to both problems 
(with constraints on the average energy in the case of the 
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channel and on the squared-error distortion in the case of the 
source). 

We consider integral operators P from the Hilbert space 
L^(R) of square-integrable functions / : R —C U { 00 } into 
itself of the form 

/ OO 

( 1 ) 

-00 

with the kernel h £ L^(R^), i.e., Hilbert-Schmidt (HS) 
operators on (R) ED- Every such operator has a unique 
Weyl symbol p = ap G L^(R^) so that Eq. ([TJ may be written 

as 113, na 

The Weyl symbol, a concept originating in quantum mechanics 
M, HD, ca, is now a standard tool for the description of 
LTV systems m (because of its physical provenance, we 
shall often switch between variables t, uj and x, ^ standing 
for time, angular frequency and the corresponding phase 
space coordinates). The operator O, regarded as an LTV 
filter for finite-energy signals /(f), will play a central role 
in our investigations. However, for the formulation of prob¬ 
lems it will be necessary to replace P with the operator 
Pr : L^(R) -A L^(R) having the spread Weyl symbol 
ap^{t,uj) = pr{t,uj) = p{tfr,ujlr), where r > 1 is the 
spreading factor. Eq. ([TJ then turns into 

/ OO 

h{r,t,t')f{t')dt', (3) 

-OO 

where £ L^(R^) denotes the kernel, now depending 

on r. It is not difficult to express h{r, t, t') in terms of h{t, t') 
and r; however, we shall rarely make use of that representation 
since the Weyl symbol appears to be the appropriate filter 
description in our context. Although other choices are pos¬ 
sible for that symbol (also called the time-frequency transfer 
function; see m] for a systematic overview), the Weyl symbol 
excels due to some unique properties, one of them being most 
helpful later on. There is one other choice for the description 
of LTV filters: the spreading function 0, ITSll . [19]. This is 
the two-dimensional (symplectic) Fourier transform of, in our 
case, the Weyl symbol ap, 

^ JJ ^e~'*^^''~^^'>ap{x,^)dxd^, 

and its popularity in mobile communications comes from the 
fact that the representation 
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allows a simple interpretation of the operator in terms of a 
weighted superposition of time delays r and Doppler shifts 
V of the input signal. Because of ap^ir^v) = r^apijT^rv') 
we observe increasing concentration of the spreading function 
ap^ of operator around the origin of the t , z 2 -plane as 
r ^ oo. This behaviour, shared by many practical LTV 
filters and termed underspread in 1201, HD, is therefore also 
peculiar to our setting (where, in principle, r tends to infinity). 
However, it remains to be remarked that the spreading function 
would not be the proper means for formulating the subsequent 
waterfilling theorems. Theorem |2] and Theorem |3] 

The present paper evolves from previous work presented 
in o. We now give a brief overview of the contributions 
of our paper with emphasis on extensions and modifications 
compared to lfT2l : for details, refer to the text. The LTV filters, 
initially arbitrary HS operators, are later restricted to those 
having Weyl symbols in the Schwartz space of rapidly decreas¬ 
ing functions (thus including the bivariate Gaussian function 
used in IfT^ L The waterfilling theorem for the capacity of 
the LTV channel is now stated in terms of the reciprocal 
squared modulus of the spread Weyl symbol of the LTV 
filter. Similarly, the reverse waterfilling theorem for the rate 
distortion function for the nonstationary source is stated in 
terms of the squared modulus of the spread Weyl symbol of 
the LTV filter. A major difference from EH is the statement 
of a new Szego theorem, which is now general enough to 
cover a large class of operators. For part of the proof of the 
Szego theorem we resort to a powerful asymptotic expansion 
having its roots in semiclassical physics m, ini, ED. Since 
our results are asymptotic in nature, there is a need to give a 
lower bound for the spreading factor so that the formulas in 
the waterfilling theorems yield useful approximations. A lower 
bound is suggested by means of the Robertson-Schrodinger 
uncertainty inequality M- Several concrete examples will 
illustrate our results. 

IT Mathematical Preliminaries 

In the present section, we fix the notation and compile 
some mathematical concepts and results associated with the 
LTV filter Q. In Section Hl-BI it will be sufficient to restrict 
ourselves to the spreading factor r = 1, therefore it is omitted; 
generalizations to the case r > 1, mostly obvious, will be 
addressed as needed in the subsequent sections. 

A. Notation 

The following notations will be adopted; The inner product 
in L2(R) is denoted by (/i,/ 2 ) = fi{x)f 2 {x) dx, and 
ll/ll = is ths corresponding norm. For an operator 

A : L\R) -)■ its adjoint A* : -)■ L^{R) is 

defined by the condition (A/ 1 ,/ 2 ) = (/i, A*/ 2 ) V/i, /2 S 
L^(K); A is called self-adjoint if A* = A. ), n = 1,2, 

is the Schwartz space of rapidly decreasing functions on R." 
(cf. my, if n = 2 and the function u additionally depends 
on the parameter r, u = u{r, x, /), then u G K.^) means 

< Cc/s < 00 


for all a = {ai,a2),f3 = (/3i,/32) S Nq, where the constants 
Cap do not depend on r. Lk(R) is the real Hilbert space of 
real-valued functions in L^(R). 


B. Fundamental Concepts and Results 

1) Weyl correspondence: The Weyl symbol ap of the HS 
operator P in ([TJ is given by the equation (sometimes called 
the Wigner transform) 03, m 

= j y,x-dx'. (4) 

The linear mapping P i-p- p = ap defined by (IH establishes a 
one-to-one correspondence between all HS operators on L^(R.) 
and all functions p G L2(M2) |[T41, im. Moreover, it holds 
(here and hereafter, double integrals extend over R^) 

^// =// \Hx,y)\'^ dxdy. (5) 

The above mapping (or rather its inverse) is called Weyl 
correspondence M- 

2) Singular value decomposition (SVD): Every HS operator 
P on L^(R) is compact and so is its adjoint P* ini. Define 
the self-adjoint operator A = P*P on L^(R). A is positive 
because {Af,f) = {Pf,Pf) > OV/ G L^(R), and compact 
since one factor, say, P, is compact. Therefore, P has the 
SVD ini, 121 Th. 8.4.1] 

N 

{Pf) (x) = ^ (/, fk)gk {x ), (6) 

k^O 

where {/o,..., /at}, {go, ■ ■ ■, Pa} (N gNq or N = 00 ) form 
orthonormal systems in L^(R), and Aq > Ai > ... > 0 are 
the non-zero eigenvalues of A (counting multiplicity) with the 
corresponding eigenfunctions fk', the functions gj. are defined 
by gk = Pfk/s/y^, the positive numbers s/Xk, k = 0,..., N, 
being the non-zero singular values of P. If P maps Lk(R) 
into itself, then the functions fk , gk will be real-valued. With¬ 
out loss of generality (w.l.o.g.) we shall assume that N = 00 
(otherwise, put Xk = 0 and choose fk, gk anyway for k > N). 
Then always A/c —>■ 0 as /c —>■ 00 . 

3) Traces of operators: By Eq. the kernel of operator 

P in (Uli has the form h{x, y) = V^gk{x)fk{y) from 

where we readily obtain fj \h{x, y)\'^ dx dy — 
combination with 15]), this results in the useful equation 


tr A = A 

k=0 



|p(x,^)pdxd/< 00 . (7) 


Since tr A (the trace of A) is finite, A is of trace class (see 
113 for a general definition of trace class operators). 

In Section |Vl| the operator A = PP* will be considered. 
Plugging P* f G L^(R) for / G L^(R) in l6|) we get for A the 
representation (A/)(x) = f K^{x,y)f{y) dy with the kernel 


y) = X! ^k9k{x)gk{y)- (8) 

k=0 


A has the same eigenvalues as A. Eurthermore, since we are 
dealing with the Weyl symbol we have the simple rule 


ap.{x,^) = ap{x,f). 


(9) 
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Hence, Eq. O holds by analogy for operator A (just replace 
“A” with “A”). 

In quantum mechanics, an operator on L^(K.) is called a 
density operator, if it is 1) self-adjoint, 2) positive and 3) of 
trace class with trace one El. Apparently, the above operators 
A, A enjoy all these properties, with the exception of the very 
last. We give them a name: 

Definition 1: A quasi density operator (QDO) is an operator 
on L^(R) of the form P*P or PP*, where P : L^(R) — >• 
L^(R) is an HS operator. 

Remark 1: In Il22l it is noted that any self-adjoint, positive 
operator on L^(R) of trace class allows factorizations as given 
in Def. [T] the above narrow-sense meaning of QDO will be 
sufficient for our purposes. 

The following result is key to our paper: If the operator 
B : L^(IR) —L^(R) has a Weyl symbol as G =5^(R^), then 
B is of trace class and its trace is given by the trace rule EH 

ti B = JJ aBix,^)dxdC (10) 

Refer to El concerning the smoothness assumption and for 
a proof. 

4) Bound on eigenvalues: If the function a = a(a:,^) : 
C is differentiable up to the sixth order and it holds 

sup |i9“^9?^a(a;,^)| < Cct < cxD (11) 

for all a = {ai,a 2 ) G I = {0,1,2,3}^, then the operator 
A defined by the Weyl symbol o is a bounded operator from 
L^(R) into itself, and it holds 

ll^/ll < coCll/ll,/ei2(R), 

where C = ^ certain constant not 

depending on the operator. This is the famous theorem of 
C alderon-Vaill anc ourt El, ini. Consequently, the absolute 
value |A| of every eigenvalue A of A is bounded by cqC. 

III. Channel Model and Discretization 

We consider for any spreading factor r > 1 held constant 
the LTV channel 

g{t) = {Prf){t) +n(t), -oo <t < oo, (12) 

where P^ is the LTV filter the real-valued filter input 
signals f{t) are of finite energy and the noise signals n{t) 
at the filter output are realizations of white Gaussian noise 
with two-sided power spectral density (PSD) Nq /2 = 0^ > 0. 
Moreover, we assume throughout that the kernel of 

operator P in (d is real-valued; observe that due to 

h{r,t,t') = rh{{r~^{t + t') -f r(f - t'))/2, 

then also the kernel h{r,t,t') of operator Pr will be real¬ 
valued so that Pr maps Tr(R) into itself. This channel is 
depicted in Lig. [T] 

We now reduce the LTV channel (fT2t to a (discrete) vector 
Gaussian channel, following the approach in El for LTI 
channels; our analysis is greatly simplified by the restriction to 


white Gaussian 
Tl(t) noise with PSD 
No/2 = 02 


fit) 


finite-energy, 

real-valued 


Prit,U}) 


LTV filter 



Ht) 


Fig. 1. Model of the LTV channel. The Weyl symbol pr{t,io) acts as a 
time-frequency transfer function; r > 1 is the spreading factor. 


n{t) - 

white Gaussian noise 
with PSD No /2 = 0-2 


Prit,Uj) 


LTV filter 


x{t) 

response 


Fig. 2. Model of the nonstationary source 


finite-energy input signals. Lor the SVD of operator Pr the r- 
dependent operator A{r) = PrPr has to be considered; since 
eigenvalues Xk and (eigen-)functions fk, Qk in the SVD now 
also depend on r, this will be indicated by a superscript • 
Then, by Eq. (©, the LTV filter © has the SVD 

OO 

{Prf)it) = Y^[>^k^]^akglf\t), (13) 

where the coefficients are Ok = (/, k = 0, 1 ,..., and 

{g^^; fc = 0,1,...} forms an orthonormal system in L^(R). 
Recall from Section III-B2I that the functions fj^^\ g^^^ are 
real-valued. The perturbed filter output signal g — Prf, 
g{t) = g{t)+n{t), is passed through a bank of matched filters 
with impulse responses hk{t) = glf\—t), fc = 0,1,... . The 
matched filter output signals are sampled at time zero to yield 
{git),hk{-t)) = bk + Uk, where bk = (git), hki-t)) = 
[X^^'^Y^'^ak, and the detection errors Uk = {n(t), hki—t)) are 
realizations of independent identically distributed (i.i.d.) zero- 
mean Gaussian random variables Nk with the variance 9^, 
Nk ^ A/'(0, 9^). Lrom the detected values bk = bk + rik we 
get the estimates dk = [xY'^]~^t'^bk = Uk + Zk for the coef¬ 
ficients Qk of the input signal /, where Zk are realizations of 
independent Gaussian random variables Zk ^ M{d, 9"^/X^^Y■ 
Thus, we are led to the infinite-dimensional vector Gaussian 
channel 

n = + fc = 0,l,..., (14) 

where the noise Zk is distributed as described. Note that the 
noise PSD 0^, measured in watts/Hz, also has the physical 
dimension of an energy. 

IV. A SzEGO Theorem for Quasi Density Operators 
Lrom now on to the end of the paper, we assume that the 
Weyl symbol p of the HS operator P in © is in the Schwartz 
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space of rapidly decreasing functions, p G oS^(]R^). 

Consider the QDO A = P*P and generalize it as above 
to the operator A{r) = P*Pr, r > 1 (being again a QDO). 
We now state and prove a Szego theorem for A{r). Szego 
theorems like the subsequent Theorem [1] are not new Il25l . 
J23], (27), I26l, but all the Szego theorems we are aware 
of are inadequate for our purposes. The proof of Lemma |2] 
(see below) rests on an asymptotic expansion of the nth 
power of A{r). Asymptotic expansions such as that (there are 
different kinds of estimating the error!) have a long tradition 
in semiclassical physics and the theory of pseudodifferential 
operators Ga, ini; rigorous proofs, however, are sometimes 
hard to hnd. A complete proof of the following Lemma [T] 
which is perhaps closest to results of ED, is shifted to the 
Appendix. Although we need the lemma only in the case of 
m = 1, it would not be natural to omit a full statement of it; 

Lemma 1: For any n S N, the Weyl symbol of the operator 
A^(r) = [A(r)]"’, r > 1, has the asymptotic expansion 

OO 

aA«{r){x,0 (15) 

k=0 

where ao(x,^) = ak S =5^(R^) else, and Eq. (fTSl l 

means that for all m S N it holds 


= ^r 2'=afe(a;/r,^/r) 

+ x/r, ^/r), 




where = R^{r,x,^) G 

Proof: See Appendix. ■ 

Asymptotically, i.e., as r —>■ oo, ao{x/r,^/r) is the dominant 
part of the asymptotic expansion (fTsT l. As customary in the 
theory of pseudodifferential operators (cf., e.g., El, El), 
the expression \pr{x, will be called the principal symbol 
of operator A^{r). Observe that the Weyl symbol of the 
nth power of the operator A{r) = Pj.P*,r > 1, has an 
asymptotic expansion analogous to that of A^{r) and the 
principal symbols of both operators are identical. 

Definition 2: For any two functions A, i? : [1, oo) —M the 
notation A = B means 


lim 

x^oo X'^ 


= 0 , 


or, equivalently, A{x) = B{x) + o(x^) as x —)> c», where o(-) 
denotes the standard Landau little-o symbol. 

In our context, x will always be the spreading factor r > 1. 
Thus A = B implies that A(r)/r^ = B{r)/r'^ + e where 
e — >■ 0 as r — >■ OO. 

Lemma 2: For any polynomial Gn{x, z) = Yl,n=i <^n{x)z^ 
with bounded variable coefficients c„(x) G M, x > 1, it holds 


E' 


GN{r,Xk^) = ^ JJ ^GN{r, |pr(a;, OP) dx d^. 


Proof: First, application of operator P* to both sides of 
Eq. (fn yields 






So we get for any / G L^(IR) the expansion 

OO 

G^(r, A{r))f = G^(r, A«)(/, /f 

k=0 

Hence, operator B{r) = GN{r, A{r)) is of trace class with 
the trace 

OO 

trP(r) = y^G7v(r, (16) 

k=0 

the series being absolutely converging since Gn{x,0) = 
OVx G [1, oo). 

Second, we use the trace rule (fTol l to obtain 

trP(r) = ^ JJ aB(r){x,C)dxd^, (17) 

where Weyl symbol of operator B{r). By 

linearity of the Weyl correspondence, has the 

expansion 

N 

(XB(r){x,Cj = y^c„(r)(T^,.(^)(x,C). (18) 

n—1 

From Lemma [T] taking m = 1, we infer that 

JJ crA"(r)(a;,Odxd^ = yy dxd^. 

Plugging ([TSll into Ell, we obtain by means of the latter 
equation 

1 ^ 

trP(r) = / / CTA"(r)(a^,C) dxd^ 

= J-^Cn{r) ff |pr(x,C)P”dxdf 

= yy G7v(r, |pr(x,0|^)dxdQ (19) 

Eq. (fT^ in combination with Eq. ( fThl l concludes the proof. ■ 
Lemma [T] shows that in the case of n = 1 and, say, m = 1, 
the Weyl symbol a(r,x,^) = o-A(r){x,0 of operator A{r) 
satishes Ineq. EJ of Section III-B4I with upper bounds Ga 
that may be chosen independent of r > 1. Consequently, the 
eigenvalues of A{r) are uniformly bounded for r > 1; 
dehne 

Ap = max< supAg’'\max|p(x,^)|^ > . (20) 

L r>l J 

This constant appears in the next theorem: 

Theorem 1 (Szego Theorem): Let p : [0,A] —^ R, A G 
(0,oo), be a continuous function such that lima;_>o+( 7 (x)/x 
exists. For any functions a, b : [l,oo) —R, where a(x) is 
bounded and Apb(x) G [0, A], define the function G{x,z) = 
a(x)g(b(x)z), (x,z) G [l,oo) x [0,Ap]. Then it holds 

// br(a;,6ndxd^. (21) 

Proof: The function /(x) = g{x)/x, x G (0,A], has a 
continuous extension F{x) onto the compact interval [0, A]. 
By virtue of the Weierstrass approximation theorem, for any 
m G N there exists a polynomial Fm^-i(x) of some degree 
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Nm — 1 such that \F{x) — Fn^-i{x)\ < em = ^ for all x G 
[0, A]. Consequently, the polynomial gNmi^) = 
of degree satisfies the inequality 

|ff(a;) - 5Ar,„(a;)| < G [0, A], (22) 

Define the polynomial with variable coefficients 

Gn^{x,z) = a{x) gMjyh{x)z). We now show that 

oo oo 

(a ^ r-^'^G{r, ) (23) 

fc—0 k—0 

and 

^ II \Pr{x,C)\'^)dxd^ 

^// G{r,\prix,^)\‘^)dxd^ (24) 

as m ^ oo, uniformly for all r > 1 . To this end, first observe 
that by Eq. (I?]) (generalized to the operator Pr,r > 1) it holds 

i ff\Prix,0\^dxd^ 

k^O 

= CpT , 

where Cp = (27r)“^ ff |p(x,^)p dxd^ is a finite constant. 

Proof of (Ed]).' By Ineq. (l22l) we get (precluding the trivial 
case Ap = 0) 


V. Waterfilling Theorem for the Capacity of 
Linear Time-Varying Channels 


A. Waterfilling in the Time-Frequency Plane 

The function Nr,r > 1, occurring in the next theorem is 
defined by Nr(t,uj) = Ni{t/r,uj/r) where 

(f, w) = \p{t, w) I ( 26 ) 

p = ap being the Weyl symbol of operator P . Recall that 
p G 0 {-) denotes the standard Landau big-O symbol 

and x ~^ denotes the positive part of x G M, a;’*' = max{0, x}. 

Theorem 2: Assume that the average energy S of the input 
signal depends on r such that S{r) = O(r^) as r —^ c». Then 
for the capacity (in nats per transmission) of the LTV channel 
(021 it holds 





{v - Nr(t,w))+ \ 
Nr{t,Uj) ) 


where v is chosen so that 


df dw, 


(27) 


S = 



Nr{t,uj))^ dfdo;. 


(28) 


Proof: The first part of the proof is accomplished by 
waterfilling on the noise variances [|2] Th. 7.5.1]. Let v]: = 
9^9^/0 = oo), /c = 0,1,..., be the noise variance 
in the fcth subchannel of the discretized LTV channel (fT4l) . We 
exclude the trivial case 5” = 0. The “water level” is then 
uniquely determined by the condition 


OO OO 

oo 

oo 

< Me^iA/Ap)Y,^i"\ 

/c=0 


oo K — 1 

s = ^ ~ (29) 

k=0 k=0 

where K = max{/c G N; < cr^} is the number of 

subchannels in the resulting finite-dimensional vector Gaussian 
channel. The capacity C of that vector channel is achieved 
when the components Xk of the input vector (Aq, ..., Xk-i) 
are independent random variables ~ Af(0, — i/^); then 


where M = sup 2 ,>i |a(x)| < oo. Since “ Cpr^, 

after division of the inequality by r^, convergence in (|2^ 
follows as claimed. 

Proof of (|2?1).' Similarly, 

I JJ G{r, \pr{x,fi)\'^)dxd^ 

~ II \Pr{x, 0 \^)dxd^\ 

< M€m{A/Ap) JJ \pr{x,^)f dxdC 

Since {2Tr)~^ JJ \pr{x,^)\^ dx d^ = CpV^, after division by 
27rr^ we come to the same conclusion as before. 

Finally, choose a (large) number m G N, so that the left- 
hand sides in ( |2^ , (l24l i become arbitrarily close to their 
respective limits. Replace function G in Eq. (ItTI i with the poly¬ 
nomial Gn,^- Then, by Lemma|2]and the uniform convergence 
in (|2^ . (l24l) the theorem follows. ■ 

Note that Theorem [T] applies to operator A{r) without any 
changes. 


kC — 1 ^ / 2 2 \ 

G = ^ - In f 1 -f — — 2 "^ j nats. (30) 
fc=0 ^ ^k J 

In the second part of the proof we apply the above Szego 
theorem. Theorem [T] To start with, note that is dependent 
on r and that always = a^{r) > 0. Additionally, suppose 
for the time being that the function (T^(r) is finitely upper 
bounded as r ^ oo. Define 

, fmax|0,lnx| if x > 0, 

= | 0 ifx = 0. 

By Eq. (EOjl we now have 



OO 


where a{r) = 1, b{r) = a'^{r)l9'^, g{x) = | ln+ x, x G [0, A], 
and A is chosen so that Apb{r) < A < oo when r is large 
enough, Ap being the constant (l20l i. This choice is possible 
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since remains bounded as r —>■ oo; w.l.o.g., we assume 

Apb{r) G [0, A] for all r > 1 . Then, by Theorem[T]it follows 
that C = C{r) satisfies 



1 

27r 


-\n+ 
2 + 


02 


\Pr{x,0\'^ da;d^ 


-\n 

2 


1 + 


Nr(t,i0) 


dfdw, (32) 


where Nj.it,w) = |- \pj.{t,uj)\ Next, rewrite Eq. 


2 ir 

OO 


as 


S = ^a^{r) 1 - 


1 


fe =0 


\ N) 

02 , 


Put a{r) = (j‘^(r), b{r) = a'^{r)l9‘^ and define 

it + 


= / (1 - t) if a; > 0, 

\ 0 ifz = 0. 


Again, w.l.o.g., we may assume that a(r) is bounded and 
Apb{r) G [0, A] for all r > 1 where A is chosen as above. 
Then, by Theorem [T] it follows that 



1 - 


\Pr{x,0\‘^ 


Nr{t,uj) I dfdw. 


+ 

dx d^ 


(33) 


2 / N 

Finally, replacement of in Eqs. (El, E3 by parameter 
1 / yields Eqs. dZTl l. (l28T l. 

We complete the proof by a bootstrap argument; Take 
Eq. (|28]) as a true equation and use it for the definition of 
(T^(= 27ri/); after a substitution we obtain 



dfdw = S{r)/r'^. 


Because of the growth condition imposed on S, ly = v{r) 
stays below a finite upper bound as r —oo, and so does 
(T^(r). Consequently, the previous argument applies and the 
capacity C is given by Eq. dZTl i. Second, by reason of 
Theorem[T] it holds for the actual average input energy S'act (f) 
= *^act = S. Thus, the dotted 

equation (l28l l applies anyway—even when S is taken as S'act- 


From the property p G it is easily deduced that, say, 

Ni(t,uj') ^ Cl (t^ + cu^), it,u)') G 

where ci is some positive constant depending on p; therefore, 
condition (l28l l certainly makes sense. 

Note that the use of Landau symbols in Theorem 1 does 
not mean that we need to pass to the limit (here, as r — 
c»). Rather, the dotted equations dZTl l. (l28l l may give useful 
approximations even when r is finite (but large enough). 

Example 1: Consider the HS operator P on L2(]R) with the 
bivariate Gaussian function 

1 / — 2.2 , 2 2 \ 

p(Ca;)=e-2(7 t +i ^ ^ (34) 



Fig. 3. Exact values and waterfilling approximation of the capacity of the 
LTV channel of Example ^ 


7 > 0 fixed, as the Weyl symbol. Then Pr^v > 1, has the 
Weyl symbol = exp[—( 7 “ 2 i 2 _|_ ..y2^2)^(2r'2)]. Pj. is 

related to the operator of the so-called heat channel ifT^ 
by the equation Pj. = cP^^\ where 5 = 2 arccoth(2r2) > 0 
and c = cosh((5/2). P^g^ has the diagonalization ifTTI . Il28ll . 

Qa 

OO 

where p = and fk{t) = {D^Hk){t) = 
is the dilated A:th Hermite function the real-valued 

eigenfunctions fk, k = 0,1 ,..., form an orthonormal sys¬ 
tem in Therefore, A{r) = P*Pj. = P^ has the 

eigenvalues = 0,1,..., so that the LTV 

channel (fTSli reduces to the discrete vector channel (fT4l) 
where the noise random variables Zk ~ Af(0, have the 
variances = {0jc)^. Take the average input energy 
S{r) = 27 rr 202 SNR, where SNR > 0 is the signal-to-noise 
ratio ( 27 rr 202 having the interpretation of the average energy 
of the relevant noise). In Fig. [2 capacity values labeled “exact” 
have been computed numerically by waterfilling on the noise 
variances, as given in the proof of Theorem ID Note that the 
results do not depend on 02 

From Theorem m after computation of the double integrals 
and elimination of parameter v we get the equation 

r 2 2 

y [Wo(( 47 rSNR-l)/e) + l]", (35) 

where Wq is the principal branch of the Lambert W function 
determined by the conditions W(x) exp[W(x)] = x for all 
X e [-e-\oo) and W(0) = 0 ED, EQI. In Fig. |3] the 
approximate capacity dTSl l is plotted as a function of r (labeled 
“waterfilling”). Surprisingly, the approximation is good even 
for spreading factors close to one. 

B. Operational Meaning of the Capacity Result 

Theorem 12] gives the information capacity (in the sense of 
ED) of the LTV channel (fT^ . To provide this result with 




















7 


an operational meaning, we need to construct a code in the 
form of a set of continuous-time signals which achieves a 
rate arbitrarily close to this capacity along with constructive 
methods of encoding and decoding. 

We use the notation in the proof of Theorem |2] For 
any fixed average input energy S > 0 and any spreading 
factor r > 1 held constant, the construction will be based 

(r) 

on the eigenfunctions f^,k = 0 ,..., iT — 1 , of operator 
A{r) — PrPr, where K is as in Eq. (|2^ : at the receiver, 
the corresponding functions '''ill 

(r) (r) 

used. Since the functions ^ are in the range of the 

operators Pr,P* with Weyl symbols Pr,pr G resp., 

these functions are rapidely decreasing, gl[\ G In 

practice, any finite collection of functions ui,, un G 
may be regarded to be concentrated on a common bounded 
interval centered at the origin and to be almost zero out¬ 
side. Thus, for the sake of simplicity, we shall assume that 
fk\^) — 9k\^) = O’ fc = 0,..., K—1, if \t\ > d/2 for some 
d G (0, oo); d will have the meaning of a delay later on. It will 
be convenient to switch from natural logarithms to logarithms 
to the base 2 and so from nats to bits. Then, the (information) 
capacity Ck of the fcth subchannel, fc = 0 ,..., df — 1 , figuring 
in the sum on the right-hand side of Eq. (l30l l. reads 

1^/0 = ^ log 2 l^lls. 

We treat the K subchannels as independent Gaussian channels 
with the noise variance v/. each and follow the classical 
approach of Shannon in, ED: Eor the fcth subchannel, for 
any rate Rk with 0 < Rk < Ck and any e > 0 generate 
a codebook {afc(m) = (afco(m),..., afc,Lj,_i(m)); m = 
l,2,...,Mfe = C with the property that 1) 

akiim),l = 0,..., Lk — 1, are realizations of i.i.d. random 
variables ^ — v^) and 2 ) the probability of a 

maximum likelihood decoding error is smaller than e for every 
transmitted codeword ak{m),m = 1,2, ...,Mk. We may 
assume that Lq = ... = Lx-i = L. Eor every message m = 
{mo,...,mK-i) G {1,2,..., Mo} x ... x (1, 2,..., M^.ij 
form the pulses 

K-l 

ui{m,t-ld) = ^ aki{mk)fk\t- Id), l = 0,...,L-l, 

k=0 

and take the pulse train 

L-l 

u(m,t) = ui(m,t — Id) (36) 

1=0 

as input signal to the physical channel. During transmission 
over that channel, each pulse ui{m,t — Id) undergoes a 
distortion modeled by the LTV filter Q, and results in the 
deformed pulse 

K-l 

Id) = ^ aki{mk)gl[\t - Id). 

k^Q 

Thus, the output signal of the physical channel is 

L-l 

y{m,t) = E vi{m, t — Id) + n{t), 

1=0 


where n{t) is a realization of white Gaussian noise as in the 
LTV channel model (fTSl i. Eor any of the K subchannels, pass 
the signal y{m,t) through the matched filter with impulse 
response hk{t) as given in Section Hill sample the matched 
filter output signal at time Id, I = Q,...,L — 1. Since 
y{m,t) = vi{m,t — Id) + n{t) if \t — ld\ < d/2, we 
again obtain estimates dki{mk) = aki{mk) + Zki for aki{mk), 
where Zki are realizations of independent Gaussian random 
variables ^ J\f{Q,v/.). Maximum likelihood decoding of the 
perturbed codeword dk{mk) = {dkoimk), ■.., dk,L-i{mk)) 
yields the correct codeword ak{mk) (thus, ruk) with a prob¬ 
ability of error smaller than e. At the transmitter, choose 
the message m at random such that each component ink 
has probability M//^ and is independent of the other compo¬ 
nents; convey m through a pulse train as described. Then— 
treating each of the K subchannels separately—the total rate 
= T [RkL) (in bits per pulse) is attained with a 

total probability of a decoding error smaller than Ke. When 
L ^ oo, Shannon’s theory m ensures that e can be made as 
small as we wish. Moreover, i?tot R ^ Rq + ... + Rk-i 
and, by the law of large numbers, the average input energy 
^EtoEk^oaUruk) tends to ^ 

with probability 1. Einally, since the rate R may be chosen 
arbitrarily close to the capacity C = Cq -I- ... -f Ck-i (at 
the expense of a larger length L of the pulse train), the 
construction of the desired coding system is complete. 

Example 2: Consider the LTV channel (fT2l) with the op¬ 
erator Pr = cPg'^'^ of Example [D The eigenfunctions of 
operator A(r) = P/Pr are the functions f][\t) — fkit) = 
{DjHk){t), k = 0 , 1 ,..., (here, not depending on r); the 
functions gl/^ in the SVD (foT l of Pr coincide with 
for all fc. Now, choose specifically r = 2, 7 = 1/10 and 
take the average input energy S = 27rr^0^ SNR (as generally 
assumed in Example [D with SNR = 100 and noise PSD 
No/2 = 6^ — 0.01 (unit omitted). Waterfilling on the noise 
variances = {9/c)'^{p = e~^), k = 0 , 1 ,..., as 
given in the proof of Theorem|2] yields the number of AT = 11 
subchannels. In Eig.|4}a), the first K eigenfunctions fj/~\t) = 
{DjHk){t), k = 0,..., K — 1, are displayed. The portion of 
an input pulse train plotted in Eig. |4}b) has been computed 
according to Eq. (l36T l with the delay parameter d = 6a, 
a = \/2r^, by numerical simulation of the involved random 
variables. Observe that there is no appreciable overlap of 
individual pulses. Each pulse transmits 22.6 bits (=15.7 nats, 
cf. Eig. O of information arbitrarily reliably [provided that 
the length of the pulse train(s) becomes larger and larger]. 
The meaning of parameter a will be explained in Section IVIII 


C. Comparison with Classical Work 

Gallager’s theorem ID Th. 8.5.1] gives the capacity of LTI 
channels under very general assumptions. In the case of an 
LTI filter with a bounded and square-integrable frequency 
response H{uj) = J_^e~^‘^*h(t) dt (a.k.a. transfer function; 
fc 7 ^ 0 is the impulse response) and additive white Gaussian 
noise of PSD No/2 = 6^ > 0 at the filter output, Gallager’s 
theorem states that the capacity (in bits per second) is given 




Fig. 4. (a) First eleven eigenfunctions, dilated Hermite functions, for the 

LTV channel of Example [2| (b) Portion of an input pulse train (centered at 
the origin) to the physical channel and the corresp. distorted output (without 
noise) of the same example. Time t is measured in some unit of time; on the 
y-axis, also the physical dimension is omitted. 


parametrically by 
1 


C = 


s = 


27T 


1 / (^-iV(w))+ 


do; 


(v — N{uj)y dw, 


(37) 

(38) 


where z/ is the parameter, S is average input power, and 




(39) 


We observe a perfect formal analogy between the waterfilling 
formulas ( ITtI i. dMt and those in Theorem |2] Moreover, the 
functions ( l39b and (l26l l are the reciprocal squared modulus of 
the (time-frequency) transfer function of the respective filter 
times the same noise figure. 

Eqs. dZTl i. (l28l l may also be used, of course, for a parametric 
representation of the function C{S) with v as parameter. 


VI. Reverse Watereilling Theorem for Related 
Nonstationary Sources 

In the present section, we consider the nonstationary source 
formed by the nonstationary zero-mean Gaussian process 
given by the Karhunen-Loeve expansion 

OO 

(40) 


where the coefficients X^, k = 0,1,..., are independent 
random variables ~ Af{0,al) with the variances cr^ = 
ct > 0. This is the response of the LTV filter (fOl l 
to white Gaussian noise with PSD Nq/2 = cf. ||2l- This 
source is depicted in Fig. |2] 


A. Wigner-Ville Spectrum of the Source 

In the present subsection, the spreading factor r > 1 is 
initially not essential, hence set to one and not displayed. 

The Wigner-Ville spectrum (WVS) 4>(f,a;) of the nonsta¬ 
tionary random process {X{t),t S R} in (l40l l describes its 
density of (mean) energy in the time-frequency plane |[32|. 
The WVS may be regarded as the nonstationary counterpart 
to the PSD of a stationary random process. It is defined by 
means of the Wigner distribution Wx of the realizations x{t) 
of {X(f)} and then taking the expectation ^f2\ . Since x{t) is 
almost surely in L^(R), we may write 

{Wx)it, uj) = ^ J e-'‘^*'x + 0 2 ; - 0 dt'. 

The WVS 4)(f, w) = E[{WX){t,u))] of the random process 
{X(t)} is then given by 

^(i,w) = —y e [t +-,t --j dt', (41) 

where = E[X(fi)X(< 2 )] is the autocorrelation 

function. Appropriately enough, identities such as (EB are 
called a nonstationary Wiener-Khinchine theorem in ll^ . A 
computation yields 

CXD 

ff{ti,t 2 ) = cr^y^Afe5fc(ti)5fc(f2) = cr^Ar^(f 1 ,^ 2 ), 
k=0 

where is the kernel of the operator A — PP*, see Eq. (|8]l. 
By means of the Wigner transform EJl, the Weyl symbol of 
A becomes 

r°° . , ft' t'\ 

a^{t,uj) = J e K^i^t +-,t--j dt'. (42) 

Comparing Eqs. (l42l i and (HTI) we thus obtain 

2 

•cr^(f,a;). 

In the general case r > 1, the WVS depends on r and we 
shall write $(r, •, •) for it; then, the latter equation becomes 

2 

= ^A(r) ^)- (43) 

By use of the trace rule (fTOl i and Eq. (|7]i (rewritten for A and 
then generalized to r > 1) we conclude that 

JJ $(r,f,a;) dfdw = cr^ • ^ JJ w) df dw 

= ■ tv A{r) 

00 

where the last infinite sum is indeed the average energy 
E{r) — realizations x{t) of the random 

process (l40l l: Eq. (l25T l yields 

E{r) = Cpr^a"^. 


(44) 
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By means of Lemma [T] we get from (|43] ) for the WVS the 
asymptotic expansion 

2 / oo 

$(r,f,a;) - ^ I \prit,uj)\‘^ + ^r~^^dk{t/r,u}/r) 

^ \ fc=i 

where dk € The expression a;)P—call it 

principal term of the WVS $(r, f,w)—will play a prominent 
role in the next subsection. 

Remark 2: Asymptotically, the principal term might be a 
good substitute for the WVS $(r, f,a;) itself. It is not only 
similar in shape, but it also gives the same average energy 
[see (|25 T i 1 and is non-negative throughout (cf. JM)). 



B. Reverse Waterfilling in the Time-Frequency Plane 

Substitute the continuous-time Gaussian process {X{t), t G 
K} in (l40l) by the sequence of coefficient random variables 
X — Xo, Xi,.... For an estimate X — Xg, Xi,... of X we 
take the squared-error distortion D = — XkY'] 

as distortion measure. In our context, D depends on r and it 
always holds 0 < D{r) < E(r), where E(r) is as in (l44l i. 

1) Computation of the rate distortion function: In the next 
theorem, the function ^r,r > 1, is defined by = 

w/r) where 

2 

p G c^(R^) being the Weyl symbol of operator P. Recall that 



dtduj = E{r). 


The Landau symbol n(-) is defined for any two functions as 
in Def.|2]as follows: A{x) = Cl{B{x)) as a: —oo if B{x) > 0 
and Ymimix^oo A{x)/B(x) > 0. 

Theorem 3: Assume that the foregoing average distortion 
D depends on r such that D{r) = n(r^) as r —oo. Then 
the rate distortion function R = R{D) for the nonstationary 
source (l40l i is given by 


R=7r- 

27r 


R2 


Z A 


dt dec, 


(45) 


where A is chosen so that 


D = 



min {A, w)} dido;. 


(46) 


We exclude the trivial case D — E{r). Then K > \ and the 
necessary rate R — R{D) for the parallel Gaussian source 
{Xq, ... ,Xk-i) amounts to ||2] Th. 10.3.3] 

K-i ^ 2 

R= ^ 

k=0 

Now we apply the above Szego theorem, Theorem[T] Again, 
depends on r. Suppose for the time being that 6^ = 9^{r) is 
finitely upper bounded for r > 1 and positively lower bounded 
as r —oo. By Eq. ( l47b we have 


D = S^{t) min < 1 


fe =0 

CO 


(r) 

9^(r) * 




fe=0 


where a(r) = 6*^(r), b{r) = a^j9'^{r), g{x) = min{l,a:}, 
X G [0, A], and A is chosen so that Apb{r) < A < c» when 
r is large enough, Ap being the constant (l20l) . This choice is 
possible since 9^{r) is positively lower bounded as r —>■ c»; 
w.l.o.g., we assume here and hereafter that Apb{r) G [0, A] 
for all r > 1. Already, a{r) is bounded for r > 1. Then, from 
Theorem [T] we infer that 


27r 


R = Tr 


9^{r) 


^ dxdC 


9‘^{r) 

2tt 


, d>r{t,Uj) > df dw, 


(49) 


where ^ \pr{t,u;)\'^. Next, rewrite Eq. ddST l as 



where ln+ is as defined in (l3n i. Taking a(r) = 1, b{r) = 
cr^/0^(r), g{x) = i ln+cc, cc G [0, A], A chosen as before, 
by Theorem [T] it follows that 



1 

27r 




br-(a:,C)l 


4>r(f,w) 

2ir 


dtduj. 


dx d^ 


(50) 


Einally, replacement of in Eqs. (fSOl l. ( l49b by the param¬ 
eter A yields Eqs. (l45l l. (|4^ . 

Again, we complete the proof by a bootstrap argument: Take 
Eq. (|46] | as a true equation and use it for the definition of 
9^(= 27rA); after a substitution we obtain 


The rate is measured in nats per realization of the source. 

Proof: The reverse waterfilling argument for a finite 
number of independent Gaussian sources 13, El carries over 
to our situation without changes, resulting in a finite collection 
of Gaussian sources Xq^ ... ,Xk-i where K = max{fc G 
> 9^} and the “water table” 9^ is chosen as the 
smallest positive number satisfying the condition 

OO 

D = '^mm{9'^,al}. (47) 


min{A, $i(f,a;)} df dec = D{r)/r^. 

Because of the growth condition imposed on 79, A = A(r) 
stays above a positive lower bound as r —oo and so does 
9^{r). Moreover, always 0^(r) < 27rAniax may be chosen 
where A„iax — maxj ^ $i(f, cc). The rest of the argument 
follows along the same lines as in the proof of Theorem |2] ■ 
Example 3: Consider the same “Gaussian” LTV filter (op¬ 
erator) P with Pr = cP^^'^ as in Example [T] The coefficients 
Xo,Xi,... of the random process {X(f)} in (l40l i then form 
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Spreading factor r 

Fig. 5. Exact values and reverse waterfilling approximation of the rate for 
the nonstationary source of Example 0 


a sequence of independent random variables ^ A/^(0, a^) with 
the variances (cf. lEl). For any average 

energy E{r) = 2~^r^a^ of {X(f)} define the distortion by 
D{r) = _B(r)/SDR, where the signal-to-distortion ratio SDR 
is at least one. In Fig. |5] “exact” rates R have been computed 
numerically by reverse waterfilling on the signal variances, as 
given in the proof of Theorem |3] 

From the two equations in Theorem [3] we obtain by elimi¬ 
nation of parameter A the closed-form equation 

i?= ^[W_i(-l/(e-SDR)) + l]", (51) 

where W_i is the branch of the Lambert W function de¬ 
termined by the conditions W(a;) exp[W(a:)] = x for all 
X G [—e“^,0) and W(a;) —>■ —oo as x —)> 0— ll29l . ll^ . In 
Fig. |5] the approximate rate (ISTT i is plotted against r (labeled 
“reverse waterfilling”). Again, we observe a surprisingly good 
approximation even for spreading factors close to one. 

2) Comparison with classical work: In Theorem |3 Eqs. 
(l45l l. (l46l l may also be used for a parametric representation of 
the rate distortion function R{D). In parametric form, R{D) 
has been given by Berger m for a broad class of station¬ 
ary random processes. In the latter parametric interpretation, 
Eq. ( |45] | is in perfect analogy to ||9] Eq. (4.5.52)] [with the 
(principal term of) WVS instead of the PSD], likewise Eq. (l46l l 
with regard to S Eq. (4.5.51)] (apart from a factor ^). 


VII. A Lower Bound for the Spreading Eactor 


Until now there has been no indication on how large the 
spreading factor r should at least be chosen so that the 
dotted equations in the above waterfilling theorems yield 
useful approximations. The purpose of the present section is 
to identify a presumed lower bound for r. 

Eor any r > 1 define the operator A{r) : L^(R) —?> L^(R) 
by the Weyl symbol = 27rp(r, x,^), where 


p{r,x,^) 




(52) 


Then A{r) is self-adjoint, positive, of trace class with the trace 
trA(r) = ff p(r,x,^)dxd^ = 1. Thus, A{r) is a density 
operator and the Robertson-Schrodinger uncertainty inequality 
(RSUI) applies QD; it reads; Eor any density operator on 
L^(R) with a Weyl symbol of the form 2ttp{x,^), define the 
moments (for convenience, put xi = x, X 2 = 0 


(53) 


Pi= Jj Xip(xi,X 2 )dxi dX 2 , 

CTij = jj - Pj)p{xi,X 2 )dxi dX 2 

and write of = aa^ i,j = 1,2. Then it holds 


— ^12 


4 ’ 


(54) 


(55) 


S(r) = 


where h is the reduced Planck constant (which in our context 
is always set to one). 

Now replace p(x, with p(r, x, ^); since p{r, x, depends 
on r, we shall write p,i{r), cr^(r), o'ij{r) for its moments 
(El, (in. Although p{r, x, is not a true probability density 
function (PDE), since it may assume negative values, its 
covariance matrix 

fTl(r) <712 W 
<712 (r) <7|(r) 

is always positive definite (as is the covariance matrix of any 
density operator ll35l l. The operators Pr,r > 1, may also 
be viewed as time-frequency localization operators (TELOsk 
comprising in part the TELOs introduced by Daubechies caQ 
Since p(r, f,a;) is the normalized WVS $(r, f,a;) discussed in 
Section IVl-AI Icf. Eq. (l43T l1. it is natural to define the ellipse 
of concentration (EoC) of Pr as the boundary of the region 
in phase space described by the inequality 

(x-/ii(r), ^-/i 2 (r)) <4 (56) 

and having the property that the uniform distribution on it 
has the same first and second moments as the PDE at hand 
1. Since the EoC 


has the area = 7r-\/det(4E(r)), 
the RSUI can now be recast in the inequality Ac = 
det E(r) > A'k^/RFJa = 27r, or phrased in words: The 
area of the EoC of operator Pr^r > 1, is at least 2 tt. 

However, this is not a useful criterion since it holds for any 
r; to get a useful criterion, consider the (true) PDE 


Pr{x,^) = 


ff br(a;',^Opdx'df ’ 


(57) 


i.e., the normalized principal symbol of A(r) [or A{r)'\. Note 
that the denominators in ([52l i and (fSTl i coincide, 

JJ = JJ |pr(a;,0|^dxd^, 

which is a simple consequence of Eq. (|7]i (in terms of A), 
Eq. ® and a generalization to r > 1; moreover, due to 
Lemma [Hit holds that 


= \Pr{x,0f +r ^Ri(r,x/r,5/r). 


(58) 


^Actually, the operator appearing in Example 0 originates in such a 

TFLO (also called a Daubechies operator) with Gaussian weight in time and 
frequency; see (n], ED 
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The rationale is now as follows; When r is large, pr{x,^) 
will be “close to” p{r,x,^); then the RSUI (l55l l for p{r,x,^) 
may be transposed to pr{x,^), resulting in a constraint on 
7’. With this in mind, replace in (l53T l. (l54l i function p{xi,X 2 ) 
with pi(a;i,a:2) and denote the new values for pi, af, aij by 
rrii, sf, Sij, respectively. By means of Eq. (fSST l and observing 
that the common denominator in (l52l i. (l57l i evaluates to 
2TTCpr^, we then obtain pi{r) = rriir + o(l) and by this 
(^) = Plugging the latter in the RSUI (l55l l 

for p{r,x,^) finally results in the desired constraint 


> 


2y^sl4 - 


+ 0 ( 1 ). 


’12 


(59) 


Ineq. ( |59] | suggests a lower bound for the spreading factor 
r, thus providing the wanted criterion (in practice, the error 
term would be neglected). Note that asymptotically, i.e., as 
r ^ oo, Ineq. ( |59] ) (with vanishing error term) becomes a 
necessary condition. 

Example 4: Consider the HS operator P on (M) with the 
Weyl symbol p G as given in Eq. (1?^ of Example [1] 

for any fixed parameter 7 > 0. Then the Weyl symbol pr of 
operator Pr,r > 1, satisfies JJ |pr(a;, dx = irr^, so 
that the PDE (l57l l becomes 


Pr{x,0 




(60) 


An evaluation of the integrals in ( l57t . ( l54l l yields rrii = m 2 = 
0, si = 7^/2, S2 = 7~^/2 and S12 = S21 = 0. Consequently, 
Ineq. ( |59] | turns into 


> 1 + 0(1), 


which, neglecting the error term, means no restriction at all. 
In fact, in Eig. |3] and Eig. |5] the approximation is already 
acceptable for spreading factors close to one. 

Einally, we add the explanation of the parameter a occurring 
in Example |2] of Section IV-BI To this end, we determine the 
EoC (im of the above operator Pr, r > 1, by the use of the 
identity Pr = cP'f'^ (see Example [U. The Weyl symbol of 
the operator Pg'^'^o = P^)^^ is given in closed form in 

ca. By this means, Eq. (l52]) readily becomes 


P{r,x,^) 


1 / 


where a = 7\/cothi5, /3 = 7“^-\/coth J. The exact EoC of 
the operator Pr is therefore the ellipse in phase space with 
the semi-axes Ox = \f2a, bx = and the equation 


xVal + e/bl = l. 


Erom the PDE dfiOl) . we obtain asymptotically, i.e., as r —c», 
the approximate EoC with semi-axes a = b = \/2r /7- 

Eor instance, in the case of r = 2,7 = 1/10 we find the rather 
good approximations a = 0.2828, b = 28.28 (units omitted) 
of the exact values Ox = 0.2850, fix = 28.50 (which is 
somewhat surprising since r = 2 is still small). In Example |2] 
the foregoing value of a has been used as an estimate of the 
effective half duration of a pulse. 


VIII. Conclusion 

Waterfilling theorems in the time-frequency plane for the 
capacity of an LTV channel with an average energy constraint 
and the rate distortion function for a related nonstationary 
source with a squared-error distortion constraint have been 
stated and rigorous proofs have been given. The waterfilling 
theorem for the LTV channel has been formulated in terms of 
the reciprocal squared modulus of the spread Weyl symbol of 
the LTV filter (times a noise figure), whereas in the reverse 
waterfilling theorem for the nonstationary source simply the 
squared modulus of the spread Weyl symbol (times a signal 
figure) has been used. The latter expression has been related 
to the WVS of the nonstationary source and recognized as 
its principal term. The LTV filter, initially an arbitrary HS 
operator, was later restricted to an operator with a Weyl symbol 
in the Schwartz space of rapidly decreasing functions. This 
smoothness assumption was a prerequisite for a Szego theorem 
upon which the proofs of both waterfilling theorems rested in 
an essential way. A self-contained proof of the Szego theorem 
has been given. The formulas in the waterfilling theorems 
depend on the spreading factor and are asymptotic in nature. 
Two examples with a bivariate Gaussian function as the Weyl 
symbol showed that the waterfilling theorems may perform 
well even when the spreading factor is close to one. Eor the 
general case, based on an uncertainty inequality, a lower bound 
for the spreading factor has been suggested. 

Appendix 
Proof of Lemma[I] 

In this appendix, we shall write x = {xi,X 2 ) etc. for 
phase space points (x,^) G and dtc = da;ida;2 etc. 
for the corresponding differential. Also, we use the notations 
(x) = (1 +xl + Oa; = 1 — 9x1 “ ^^2 = 

= x^^x^^ with the multi-indices a = (01,02), 
[3 = (/3i,/32) G Nq. The following proof draws on iflTll . Il26ll . 

Eor any two operators P, Q : L^(]R) —L^(]R) with the 
Weyl symbols p, q G resp., the Weyl symbol of the 

product PQ, denoted hy p^q, is given by El 

= 4/ / p{z + x)q{z + y)e^'^‘^‘^^^^’y^dxAy, (61) 

where det(a;;y) = Xiy 2 — X 2 yi- Since we need to com¬ 
pute the Weyl symbol pr^qr,r > 1, we change to the 
more convenient operation defined by {pr^ qr){z) = 

{p^rq){z/r). A computation yields (see El and 1^ in the 
case of r = 1) 

m —1 

{ p # rq ){ z ) = ^ r ~'^'‘ ak { z )+ r ~'^^ Rm { r , z ) 
k^O 

with the functions Ofe G y{R^) given by 0^(2) = Fk(z,z), 
and 

R,^{r,z)=m f f 

Jo I ’’’ 7r2 Jr2 

■ Fm{z + tx/r,z + y/r)dxdy\dt, (62) 
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where 

'k 

Fk{x,y) = - dx 2 dyi)’' [p{x)q{y)] (63) 

for k = 0,... ,171. Note that 00 ( 2 :) = p{z)q{z). 

First, we show that when a G b G 

then c = aij^rb G o5^(',K^), r > 1. To this end, note 
that for any positive integers L, M it holds Q 2 idet(x-y) _ 
and By 

partial integration we then obtain from dMl l the representation 


c(r, z) 



^2idet(x-.v)y,L air,Z + x/r) 

(2a.)2M 


Oy[b{z + y/r)] 


(2y) 


2L 


dtc dy. (64) 


Concerning the computation of {>^(’) occurring in (l64l i. note 
that for any 7 = (71,72 ) G Nq, 0 < 71 + 72 < 2L, it holds 


\dZ[{2x)-^^]\ < C{2 x)-^^,xG 


where C = C{M, L) < 00 . Consequently, for any cx,P G Ng, 
the expression \z^d^c{r, z)\ may be upper bounded for all 
2; G and r > 1 by a linear combination (with positive 
coefficients effectively not depending on r since 1/r < 1) of 
terms of the form 


|aA(r, 2 ; + at/r) 6 ^( 2 ; + y/r)| 


(2at)2M(2y) 


2L' 


da; dy, 


(65) 


where a\ G o 5 ^(-,]R^), 6 ^ G o 5 ^(]R^) and M > 2, 2 < L' < L 
(L sufficiently large). Here we have used, possibly repeatedly, 
the fact that when G oS^(]R^), then Ziby^{z + y/r) = 
\y{z + y/r) - {yi/r)bfj,{z + y/r) where dehned by 
bfj_{z) = Zibfj,{z) is again in Replace the numerator of 

the integrand in (l65] l by a constant upper bound and integrate. 
Summing up, we obtain the inequality 

|2:^5“c(r, 2)1 < Ca/g < 00, 2: G (66) 


where the constant Ca/s does not depend on r > 1 . 

Second, we show that Rm G ,5^(-,R^). The integral 
I{t,r,z) between braces {...} in (l62T i is a linear combi¬ 
nation of expressions on the right-hand side of Eq. (l64l i 
after the substitution a(r, 2 ; + x/r) G- {d^p){z -f tx/r) and 
b{z + x/r) G- {d^q){z + x/r), the partial derivatives (of order 
m) coming from those in (l 6 ^ . Now let a, G N§ be arbitrary. 
By the same reasoning as before, we infer that \z^d'^I{t, r, z)\ 
may be upper bounded by a linear combination (with positive 
coefficients effectively not depending on r and t since t/r < 1 ) 
of terms analogous to (l65l l. Taking the supremum of the 
numerators of the integrands, we get rid of the variable t so 
that the integral with respect to t occurring in (l62l l may be 
computed (evaluating to 1). Again summing up, we obtain 
the analog to Ineq. (l 66 T l. where c(r, 2 ;) is to be replaced with 
Rm ^)- 

Now we are in a position to prove Eq. Take any 

function b G Then 


{{p#Tq) #Th){z) 


m—1 


'^^{ak#rb){z) 

+ r~'^^Rm,o{r,z), 




where Rm,o = Rm #rb € ^(*, 1 ^^) and 

m — k — l 

{au #rh){z) = Y, r-^^akj{z) + Rk,m-k{r, z) 

i=o 

with the functions auj G and Rk^m-k G 

fc = 0,..., m — 1. One readily hnds 

m— 1 

{{P#rq)#rb){z) = Y r~‘^'"ak{z)+r~'^'^Rm{r,z), 
k=0 

where the functions d/c G =5^(R^) and Rm G are 

given by 

m 

dk{z) = Y aij{z), Rm{r,z) = YFk,m-k{r,z). 
i+j = k fc = 0 

i,3>0 

Note that do = pqb. Putting q = p and alternately b = p or 
b = p we obtain by induction, observing that brackets may be 
omitted, for the Weyl symbol (TA"{r){z) of A"'(r), 

{Pr// Pr// ■ ■ ■ 4I^Pr// Pr){^) — {p //rP 4/r ■ ■ ■ 4/rP //rP) (^/^) ; 

'- . -^ 

n factors Pr4/Pr 

the asymptotic expansion as given in Eq. (fTsT i [written without 
superscripts ~ again and after the substitution 2 : ^ (a;, 61- 
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